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ABSTRACT 

We explore the striking mathematical connections that ex- 
ist between market scoring rules, cost function based pre- 
diction markets, and no-regret learning. We show that any 
cost function based prediction market can be interpreted as 
an algorithm for the commonly studied problem of learning 
from expert advice by equating trades made in the mar- 
ket with losses observed by the learning algorithm. If the 
loss of the market organizer is bounded, this bound can be 
used to derive an 0(y/T) regret bound for the corresponding 
learning algorithm. We then show that the class of markets 
with convex cost functions exactly corresponds to the class 
of Follow the Regularized Leader learning algorithms, with 
the choice of a cost function in the market corresponding to 
the choice of a regularizer in the learning problem. Finally, 
we show an equivalence between market scoring rules and 
prediction markets with convex cost functions. This implies 
that market scoring rules can also be interpreted naturally 
as Follow the Regularized Leader algorithms, and may be of 
independent interest. These connections provide new insight 
into how it is that commonly studied markets, such as the 
Logarithmic Market Scoring Rule, can aggregate opinions 
into accurate estimates of the likelihood of future events. 

1. INTRODUCTION 

Imagine you are interested in learning an accurate esti- 
mate of the probability that the United States unemploy- 
ment rate for a particular month will fall below 10%. You 
could choose to spend hours digging through news arti- 
cles, reading financial reports, and weighing various opin- 
ions against each other, eventually coming up with a rea- 
sonably informed estimate. However, you could potentially 
save yourself a lot of hassle (and obtain a better estimate!) 
by appealing to the wisdom of crowds. 

A prediction market is a financial market designed for 
information aggregation. For example, in a cost function 
based prediction market [fj, the organizer (or market maker) 
trades a set of securities corresponding to each potential out- 
come of an event. The market maker might offer a security 
that pays $1 if and only if the United States unemployment 
rate for January 2010 is above 10%. A risk neutral trader 
who believes that the true probability that the unemploy- 
ment rate will be above 10% is p should be willing to buy 
a share of this security at any price below $p. Similarly, 
he should be willing to sell a share of this security at any 
price above $p. For this reason, the current market price 
of this security can be viewed as the population's collective 
estimate of how likely it is that the unemployment rate will 
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be above 10%. 

These estimates have proved quite accurate in practice 
in a wide variety of domains. (See Ledyard et al. (2tJ for 
an impressive assortment of examples.) The theory of ra- 
tional expectations equilibria offers some insight into why 
prediction markets in general should converge to accurate 
prices, but is plagued by strong assumptions and no-trade 
theorems [3l|. Furthermore, this theory says nothing of 
why particular prediction market mechanisms, such as Han- 
son's increasingly popular Logarithmic Market Scoring Rule 
(LMSR) [ll| Hq[ . might produce more accurate estimates 
than others in practice. In this work, we aim to provide ad- 
ditional insight into the learning power of particular market 
mechanisms by highlighting the deep mathematical connec- 
tions between prediction markets and no-regret learning. 

It should come as no surprise that there is a connection 
between prediction markets and learning. The theories of 
markets and learning are built upon many of the same funda- 
mental concepts, such as proper scoring rules (called proper 
losses in the learning community) and Bregman divergences. 
To our knowledge, Chen et al. [g| were the first to formally 
demonstrate a connection, showing that the standard Ran- 
domized Weighted Majority regret bound 0] can be used as 
a starting point to rederive the well-known bound on the 
worst-case loss of a LMSR marker maker. (They went on 
to show that PermELearn, an extension of Weighted Major- 
ity to permutation learning [53], can be used to efficiently 
run LMSR over combinatorial outcome spaces for betting on 
rankings.) As we show in Section[4] the converse is also true; 
the Weighted Majority regret bound can be derived directly 
from the bound on the worst-case loss of a market maker 
using LMSR. However, the connection goes much deeper. 

In Section 0] we show how any cost function based pre- 
diction market with bounded loss can be interpreted as a 
no-regret learning algorithm. Furthermore, if the loss of the 
market maker is bounded, this bound can be used to derive 
an OiyT) regret bound for the corresponding learning algo- 
rithm. The key ides is to view the trades made in the market 
as losses observed by the learning algorithm. We can then 
think of the market maker as learning a probability distri- 
bution over outcomes by treating each observed trade as a 
training instance. 

In Section [S] we go on to show that the class of con- 
vex cost function based markets exactly corresponds to 
the class of Follow the Regularized Leader learning algo- 
rithms [34l . Hll . Il7l | in which weights are chosen at each time 
step to minimize a combination of empirical loss and a con- 
vex regularization term. This allows us to interpret the se- 



lection of a cost function for the market as the selection 
of a regularizer for the learning problem. Furthermore, we 
prove an equivalence between another common class of pre- 
diction markets, market scoring rules, and convex cost func- 
tion based marketsQ which immediately implies that market 
scoring rules can be interpreted as Follow the Regularized 
Leader algorithms too. These connections provide insight 
into why it is that prediction markets tend to yield such 
accurate estimates in practice. 

Before describing our results in more detail, we review the 
relevant concepts and results from the literature on predic- 
tion markets and no-regret learning in Sections [3] and 

2. PREDICTION MARKETS 

In recent years, a variety of compelling prediction mar- 
ket mechanisms have been proposed and studied, includ- 
ing standard call market mechanisms and Pennock's dy- 
namic parimutuel markets [3(j. In this work we focus on 
two broad classes of mechanisms: Hanson's market scoring 
rules [iH Oil and cost function based prediction markets as 
described in Chen and Pennock Jj|. We also briefly discuss 
the related class of Sequential Convex Parimutuel Mecha- 
nisms [lj in Section \5. 41 

2.1 Market Scoring Rules 

Scoring rules have long been used in the evaluation of 
probabilistic forecasts. In the context of prediction markets 
and elicitation, scoring rules are used to encourage individ- 
uals to make careful assessments and truthfully report their 
beliefs [33l . [Til . . In the context of machine learning, scor- 
ing rules are used as loss functions to evaluate and compare 
the performance of different algorithms H3| • 

Formally, let {1, ■ • • , N} be a set of mutually exclusive and 
exhaustive outcomes of a future event. A scoring rule s maps 
a probability distribution p to a score Si(p) for each outcome 
i, with Si(p) taking values in the extended real line [-co, oo]. 
Intuitively, this score represents the reward of a forecaster 
might receive for predicting the distribution p if the outcome 
turns out to be i. A scoring rule is said to be regular relative 
to the probability simplex Ajv if £/i=i PiSi(p') G [— oo,oo) 
for all p, p' £ Ajv, with X/-=iP* s <(p) (—00,00). This 
implies that Si(p) is finite whenever pi > 0. A scoring 
rule is said to be proper if a risk-neutral forecaster who be- 
lieves the true distribution over outcomes to be p has no 
incentive to report any alternate distribution p' , that is, if 
YliLiPi s i(P) > YliLiPi s i(p') f° r al1 distributions p' . The 
rule is strictly proper if this inequality holds with equality 
only when p — p'. 

Two examples of regular, strictly proper scoring rules 
commonly used in both elicitation and in machine learning 
are the the quadratic scoring rule 0: 

Si (p) = CH + b ^2pi - W 

and the logarithmic scoring rule [l3j |: 

Si(p) = cn + 61og(pi) (2) 

with arbitrary parameters 01, • • ■ , ajv and parameter b > 0. 
The uses and properties of scoring rules are too extensive 

1 A similar but weaker correspondence between market scor- 
ing rules and cost function based markets was discussed in 
Chen and Pennock [5| and Agrawal et al. [J]. 



to cover in detail here. For a nice survey, see Gneiting and 
Raftery 

Market scoring rules were developed by Hanson [l5l.[lr3| as 
a method of using scoring rules to pool opinions from many 
different forecasters. Market scoring rules are sequentially 
shared scoring rules. Formally, the market maintains a cur- 
rent probability distribution p. At any time, a trader can 
enter the market and change this distribution to an arbitrary 
distribution p ' of her choice0 If the outcome turns out to be 
i, she receives a (possibly negative) payoff of Sj(p") — Si(p). 
For example, in the popular Logarithmic Market Scoring 
Rule (LMSR), which is based on the logarithmic scoring 
rule in Equation [2] a trader who changes the distribution 
from p to p' receives a payoff of blog(p'i/pi). 

Since the trader has no control over p, a myopic trader who 
believes the true distribution to be r maximizes her expected 
payoff by maximizing 2i r ! s !(p")- Thus if s is a strictly 
proper scoring rule, traders have an incentive to change the 
market's distribution to match their true beliefs. The idea 
is that if traders update their own beliefs over time based on 
market activity, the market's distribution should eventually 
converge to the collective beliefs of the population. 

Each trader in a market scoring rule is essentially responsi- 
ble for paying the previous trader's score. Thus the market 
maker is responsible only for paying the score of the final 
trader. Let po be the initial probability distribution of the 
market. The worst case loss of the market maker is then 

max max (si(p) — s;(p*o)) • 
ie{i,- ,N}peA N 

The worst case loss of the market maker running an LMSR 
initialized to the uniform distribution is b\ogN. 

Note that the parameters 01, ■ • ■ ,ajv of the logarithmic 
scoring rule do not affect either the payoff of traders or the 
loss of the market maker in the LMSR. For simplicity, in 
the remainder of this paper when discussing the LMSR we 
assume that 04 = for all i. 

2.2 Cost Function Based Markets 

As before, let {1, ••■ ,N} be a set of mutually exclusive 
and exhaustive outcomes of an event. In a cost function 
based market, a market maker offers a security correspond- 
ing to each outcome i. The security associated with outcome 
i pays off $1 if i happens, and $0 otherwise^ 

Different mechanisms can be used to determine how these 
securities are priced. Each mechanism is specified using a 
differentiable cost function C : M. N — > R. This cost func- 
tion is simply a potential function describing the amount of 
money currently wagered in the market as a function of the 
quantity of shares purchased. If qt is the number of shares of 
security i currently held by traders, and a trader would like 
to purchase shares of each security (where n could be zero 
or even negative, representing the sale of shares), the trader 
must pay C(q + r) — C(q) to the market maker. The instan- 
taneous price of security i (that is, the price per share of an 
infinitely small number of shares) is then pi — dC(q) / dqi. 

2 While p' may be arbitrary, in some market scoring rules, 
such as the LMSR, distributions that place a weight of on 
any outcome are not allowed because it requires the trader to 
pay infinite amount of money if the outcome with reported 
probability actually happens. 

3 The dynamic parimutuel market falls outside this frame- 
work since the winning payoff depends on future trades. 



We say that a cost function is valid if the associated prices 
satisfy two simple conditions: 

1. For every i 6 {1, • • • , N} and every q € ~$& N , Pi(q) > 0. 

2. For every q £ R w , E l= =iP<® = 1 ■ 

The first condition ensures that the price of a security is 
never negative. If the current price of the security associated 
with an outcome i were negative, a trader could purchase 
shares of this security at a guaranteed profit. The second 
condition ensures that the prices of all securities sum to 1. 
If the prices summed to something less than (respectively, 
greater than) 1, then a trader could purchase (respectively, 
sell) small equal quantities of each security for a guaranteed 
profit. Together, these conditions ensure that there are no 
arbitrage opportunities in the market. 

These conditions also ensure that the current prices can 
always be viewed as a valid probability distribution over the 
outcome space. In fact, these prices represent the market's 
current estimate of the probability that outcome i will occur. 

The following theorem gives sufficient and necessary con- 
ditions for the cost function C to be valid. While these prop- 
erties of cost functions have been discussed elsewhere [a, [j| , 
the fact that they are both sufficient and necessary for any 
valid cost function C is important for our later analysis. As 
such, we state the full proof here for completeness. 

Theorem 1. A cost function C is valid if and only if it 
satisfies the following three properties: 

1. Differentiability: The partial derivatives 



dC{q) / dqi exist for all <f £ K and i £ {1, 



,N}. 



2. Increasing Monotonicity: For any q and q" , if q > 
q', then C(q) > £7(<f ). 

3. Positive Translation Invariance: For any q and 
any constant k, C(q + kl) — C'(q) + k. 

Proof. Differentiability is necessary and sufficient for the 
price functions to be well-defined at all points. It is easy 
to see that requiring the cost function to be monotonic is 
equivalent to requiring that Pi(q) > for all i and q. We 
will show that requiring positive translation invariance is 
equivalent to requiring that the prices always sum to one. 

First, assume that EiliP*® = 1 for all q. For any fixed 
value of q, define u = u{a) — q + al and let Ui be the ith 
component of u. Then for any k, 



C(q + kl) - C(q) 



dC(q + al) 



da 



da 



k N 

E 

i=l 
k N 



dC(u) duj 
dui da 



da 



~^^Pi(u)da = k 



This is precisely translation invariance. 

Now assume instead that positive translation invariance 
holds. Fix any arbitrary q' and k and define q = q' + kl. 
Notice that by setting q" and k appropriately, we can make 
q take on any arbitrary values. We have, 



dC{q) 
dk 



E 



8C(q) dqi 
dqi dk 



E Pi ®- 



By translation invariance, C(q" + kl) — C'(q') + k. Thus, 
dC(q) d(C(q') + k) 



dk 



Ok 



= 1. 



Combining the two equations, we have Ei=iPi® = 1. □ 

One quantity that is useful for comparing different market 
mechanisms is the worst-case loss of the market maker, 



max max qi 

q£R N ,N} 



(C(q) - £7(0)) 



This is simply the difference between the maximum amount 
that the market maker might have to pay the winners and 
the amount of money collected by the market maker. 

The Logarithmic Market Scoring Rule described above 
can be specified as a cost function based prediction mar- 
ket [IEh]. Then cost function of the LMSR is 



C7(g) = blog]Te 



lilb 



and the corresponding prices are 



Pi(q) 



dC(g) 
dqi 



This formulation is equivalent to the market scoring rule for- 
mulation in the sense that a trader who changes the market 
probabilities from f to f' in the MSR formulation receives 
the same payoff for every outcome i as a trader who changes 
the quantity vectors from any q to q' such that p(q) = f and 
p(q") = r' in the cost function formulation. 

3. LEARNING FROM EXPERT ADVICE 

We now briefly review the problem of learning from expert 
advice. In this framework, an algorithm makes a sequence 
of predictions based on the advice of a set of experts and 
receives a corresponding sequence of ZossesQ The goal of the 
algorithm is to achieve a cumulative loss that is "almost as 
low" as the cumulative loss of the best performing expert in 
hindsight. No statistical assumptions are made about these 
losses. Indeed, algorithms are expected to perform well even 
if the sequence of losses is chosen by an adversary. 

Formally, at every time step t € {1, • • • ,T}, every expert 
i £ {1, ■ • ■ , N} receives a loss £i t t G [0, 1]. The cumulative 
loss of expert i at time T is then defined as L^t = X^t=i ^»>*- 
An algorithm A maintains a weight uii : t for each expert 
i at time t, where £"=1 Wi - t = 1- These weights can be 
viewed as a distribution over the experts. The algorithm 
then receives its own instantaneous loss l^.t = Y^i=i w i,t^i,t, 
which can be interpreted as the expected loss the algo- 
rithm would receive if it always chose an expert to fol- 
low according to the current distribution. The cumulative 
loss of A up to time T is defined in the natural way as 

l A,t = Ef=i = Ef=i Er=i Wi,d i<t . 

It is unreasonable to expect the algorithm to achieve a 
small cumulative loss if none of the experts perform well. 
As such, it is typical to measure the performance of an al- 
gorithm in terms of its regret, defined to be the difference 



4 This framework could be formalized equally well in terms 
of gams, but losses are more common in the literature. 



between the cumulative loss of the algorithm and the loss of 
the best performing expert, that is, 

La.t - min L iyT - 
ie{i,--- ,n} 

An algorithm is said to have no regret if the average per 
time step regret approaches as T approaches infinity. 

The popular Randomized Weighted Majority (WM) al- 
gorithm [2Sl . 0| is an example of a no-regret algorithm. 
Weighted Majority uses weights 



Wit 



3=1 



where r\ > is a tunable parameter known as the learning 
rate. It is well known that the regret of WM after T trials 
can be bounded as 



J WM(v),T 



log A?" 

min L iT <r)T + 
!g{l,---,Ar} ' r\ 



When T is known in advance, setting rj — ^/log N/T yields 
the standard 0{yjT log N) regret bound. 

It has been shown that the weights chosen by Weighted 
Majority are precisely those that minimize a combination of 
empirical loss and an entropic regularization term [24l |25| . 
l2(il |. More specifically, the weights at time t are precisely 
those that minimize 



y^^WjLig 



-H(w) 



among all w £ Ajv, where H is the entropy. This makes 
Weighted Majority an example of broader class of algorithms 
collectively known as Follow the Regularized Leader algo- 
rithms [3J, [Isl . [I?} • This class of algorithms grew out of the 
following fundamental insight of Kalai and Vempala [22| . 

Consider first the aptly named Follow the Leader al- 
gorithm, which chooses weights at time t to minimize 
Xlili This algorithm simply places all of its 

weight on the single expert (or set of experts) with the best 
performance on previous examples. As such, this algorithm 
can be highly unstable, dramatically changing its weights 
from one time step to the next. It is easy to see that Follow 
the Leader suffers f2(T) regret in the worst case when the 
best expert changes frequently. For example, if there are 
only two experts with losses starting at (1/2,0) and then 
alternating (0, 1) , (1, 0) , (0, 1) , (1, 0) , • ■ • , then FTL places 
a weight of 1 on the losing expert at every point in time. 

To overcome this instability, Kalai and Vempala |22| sug- 
gested adding a random perturbation to the empirical loss 
of each expert ,and choosing the expert that minimizes this 
perturbed losslfj However, in general this perturbation need 
not be random. Instead of adding a random perturbation, 
it is possible to gain the necessary stability by adding a reg- 
ularizer TZ and choosing weights to minimize 



w i>t L 



i,t-i + -TZ(w t ) 



(3) 



This Follow the Regularized Leader (FTRL) approach gets 
around the instability of FTL and guarantees low regret for 
a wide variety of regularizers, as evidenced by the following 
bound of Hazan and Kale UM . 



5 A very similar algorithm was orig inally developed and an- 
alyzed by Hannan in the 1950s [141] ■ 



Lemma 1 (Hazan and Kale For any regular- 

izer TZ, the regret of FTRL can be bounded as 



J FTRL{1Z,r]),T 



min Li 

i£{l,— ,N} 



< V ii,t(w itt - wi, t+1 ) + - (TZ(w T ) - TZ{w )) 

Z / Z / n 

t— 1 ,-—1 ' 



This lemma quantifies the trade-off that must be consid- 
ered when choosing a regularizer. If the range of the regular- 
izer is too small, the weights will change dramatically from 
one round to the next, and the first term in the bound will 
be large. On the other hand, if the range of the regularizer 
is too big, the weights that are chosen will be too far from 
the true loss minimizers and the second term will blow up. 

It is generally assumed that the regularizer TZ is strictly 
convex. This assumption ensures that Equation [3] has a 
unique minimizer and that this minimizer can be computed 
efficiently. Hazan [l7| shows that if TZ is strictly convex then 
it is possible to achieve a regret of 0(VT). In particular, 
by optimizing r\ appropriately the regret bound in Lemma[T] 
can be upper bounded by 



2 /2A max {TZ(w) - TZ{w'))T (4) 
where A = max K|D1 p jl36Ajv £ T [V 2 TL(w)]- 1 e. 

4. INTERPRETING PREDICTION MAR- 
KETS AS NO-REGRET LEARNERS 

With this foundation in place, we are ready to describe 
how any bounded loss market maker can be interpreted as 
an algorithm for learning from expert advice. The key idea 
is to equate the trades made in the market with the losses 
observed by the learning algorithm. We can then view the 
market maker as essentially learning a probability distri- 
bution over outcomes by treating each observed trade as a 
training instance. 

More formally, consider any cost function based market 
maker with instantaneous price functions pi for each out- 
come i. We convert such a market maker to an algorithm 
for learning from expert advice by setting the weight of ex- 
pert i at time t using 



m,t = Pi(-eLt-i), 



(5) 



where e > is a tunable parameter and Lt-i = 
{Li,t-i, - • ■ ,£jv.t-i) is the vector of cumulative losses at 
time t— 1. In other words, the weight on expert i at time t in 
the learning algorithm is the instantaneous price of security 
i in the market when —eLj t t-i shares have been purchased 
(or eLj jt _i shares have been sold) of each security j. We 
discuss the role of the parameter e in more detail below. 

First note that for any valid cost function based predic- 
tion market, setting the weights as in Equation [S] entails 
valid expert learning algorithm. Since the prices of any 
valid prediction market must be non-negative and sum to 
one, the weights of the resulting algorithm are guaranteed 
to satisfy these properties too. Furthermore, the weights are 
a function of only the past losses of each expert, which the 
algorithm is permitted to observe. 

Below we show that applying this conversion to any 
bounded-loss market maker with slowly changing prices 



yields a learning algorithm with 0(yT) regret. The quality 
of the regret bound obtained depends on the trade-off be- 
tween market maker loss and how quickly the prices change. 
We then show how this bound can be used to rederive the 
standard regret bound of Weighted Majority, the converse 
of the result of Chen et al. [(J. 

4.1 A Bound on Regret 

In order to derive a regret bound for the learning algo- 
rithm defined in Equation [S] it is necessary to make some 
restrictions on how quickly the prices in the market change. 
If market prices change too quickly, the resulting learning 
algorithm will be unstable and will suffer high worst-case 
regret, as was the case with the naive Follow The Leader 
algorithm described in Section [3] To capture this idea, we 
introduce the notion of ^-stability, defined as follows. 

Definition 1. We say that a set of price functions 
p is 0-stable for a constant (f> if pt is continuous 
and piecewise differentiable for all i £ {1, • • • ,N} and 
Eti £f=i \ D ^(P)\ < 4> M all t, where 



dpi(q) 
dqj | 





if dP g^ is defined at t, 
otherwise. 



Defining ^-stability in terms of the Dij allows us to quan- 
tify how slowly the prices change even when the price func- 
tions are not differentiable at all points. We can then derive 
a regret bound for the resulting learning algorithm using the 
following simple lemma. This lemma states that when the 
quantity vector in the market is q, if the price functions are 
^-stable, then the amount of money that the market maker 
would collect for the purchase of a small quantity r; of each 
security i is not too far from the amount that the market 
maker would have collected had he instead priced the shares 
according to the fixed price p(q). 



Lemma 2. 
stable prices, 
such that Irvl 



Let C be any valid cost function yie 



For any e > 0, 
< e for i 6 {1, ■ 



any q G 
■■ ,N}, 



and any f € 



The proof is in Appendix [A] 

With this lemma in place, we are ready to derive the re- 
gret bound. In the following theorem, it is assumed that T 
is known a priori and therefore can be used to set e. If T 
is not known in advance, a standard "doubling trick" can be 
applied 0|. The idea behind the doubling trick is to par- 
tition time into periods of exponentially increasing length, 
restarting the algorithm each period. This leads to similar 
bounds with only an extra factor of log(T). 

Theorem 2. Let C be any valid cost function yielding in- 
stable prices. Let B be a bound on the worst-case loss of 
the market maker mechanism associated with C . Let A. be 
the expert learning algorithm with weights as in Equation\5\ 
with e — \J2B I (4>T) . Then for any sequence of expert losses 
ii.t £ [0, 1] over T time steps, 



La: 



min L iT < j2B<j)T 
ie{i,-,N} 



Proof. By setting the weights as in Equation [5] we are 
essentially simulating a market over N outcomes. Let n t t 
denote the number of shares of outcome i purchased at time 
step t in this simulated market, and denote by ft the vector 
of these quantities for all i. Note that r^t is completely in 
our control since we are simply simulating a market, thus 
we can choose to set n,t = —eti.t for all i and t. We have 
that ri.t £ [— e, 0] for all i and t since li t t £ [0,1]. Let 
q%,t = Y2t'=i r »>*' b e t ne total number of outstanding shares 
of security i after time t, with qt denoting the vector over all 
i. The weight assigned to expert i at round t of the learning 
algorithm corresponds to the instantaneous price of security 
i in the simulated market immediately before round t, that 
is, Wi,t — Pi(-eLt-i) =Pi(q t -i). 

By the definition of worst-case market maker loss, 
max; qi t t — (C(qt) — C(0)) < B. It is easy to see that we 
can rewrite the left-hand side of this equation to obtain 

T t 

max V^TfCffl-CIMia. 
From Lemma [21 this gives us that 

ie ^ m E - E ^2PM-I)ri,t + — )<B. 

t=l t = l \i = l / 

Substituting pi(q t _i) = w iti and r ht = -e£ i<t , we get 

T t N 



max V {-e£i, t ) - VV^.t (-e£ M ) < B + 

iG{l,- ,N} t— 1 jL^, jL^, 'l — 



e 2 (j>T 



and so 

t N T 

La.t — min L i T = w itt ii,t — min l i t 

»€{1,— ,N\ ^— ' ^— ' i ^— ' 

t=l i=l t-1 

~ e 2 



Setting e = ■ s /2B/(<j)T) yields the bound. 



□ 



4.2 Rederiving the Weighted Majority Bound 

Chen et al. [f| showed that the Weighted Majority regret 
bound can be used as a starting point to rederive the worst 
case loss of 6 log N of an LMSR market maker. Here we show 
that the converse is also true; by applying Theorem [2] we 
can rederive the Weighted Majority bound from the bounded 
market maker loss of LMSR. 

In order to apply Theorem [21 we must provide a bound 
on how quickly LMSR prices can change. This is given in 
the following lemma, the proof of which is in Appendix IB1 

Lemma 3. Let p be the pricing function of a LMSR with 
parameter b > 0. Then 



EE 

i=l j=\ 



dpi(q) 



dqj 



Using Equation [5] to transform the LMSR into a learning 
algorithm, we end up with weights 

„-eL,_ t _ 1 /b 



lb 



Setting e = s /Wff$F) = b^logN/T, we see that these 
weights are equivalent to those use d by Wei ghted Majority 
with the learning rate r\ = e/b = \/log N/T. As mentioned 
above, this is the optimal setting of r\. Notice that these 
weights do not depend on the value of the parameter b in 
the prediction market. 

We can now apply Theorem [5] to rederive the standard 
Weighted Majority regret bound stated in Section [3] In 
particular, setting B — blogN and <j> = 2/6, we get that 
when rj = y/]og(N)/T, 



Lemma 4. A function C is a valid convex cost function 
if and only if it is differentiable and can be represented as 



Lwm t — min LiT < 2-v/T log N 
ie{i,— ,jv} 



5. CONNECTIONS BETWEEN MARKET 
SCORING RULES, COST FUNCTIONS, 
AND REGULARIZATION 

In this section, we establish the formal connections among 
market scoring rules, cost function based markets, and the 
class of Follow the Regularized Leaders algorithms. We start 
with a representation theorem for cost function based mar- 
kets, which is crucial in our later analysis. 

5.1 A Representation Theorem for Convex 
Cost Functions 

In this section we show a representation theorem for con- 
vex cost functions. The proof of this theorem relies on the 
connection between convex cost functions and a class of func- 
tions known in the finance literature as convex risk measures, 
which was first noted by Agrawal et al. 0]. Convex risk mea- 
sures were originally introduced by Follmer and Schied @| 
to model different attitudes towards risk in financial mar- 
kets. A risk measure p can be viewed as a mapping from 
a vector of returns (corresponding to each possible outcome 
of an event) to a real number. The interpretation is that a 
vector of returns x is "preferred to" the vector x" under a 
risk measure p if and only if p(x) < p(x'). 

Formally, a function p is a convex risk measure if it satis- 
fies the following three properties: 

1. Convexity: p(x) is a convex function of x. 



C(q) = sup I Vftft - a(f) 



(6) 



for a convex and lower semi-continuous function a. Further- 
more, for any quantity vector q, the price vector p(q) corre- 
sponding to C is the distribution p maximizing Y2%=i PiQi ~ 
a(p). 

PROOF. Consider any differentiable function C : M. N — > 
R. Let p(q) = C(—q). Clearly by definition, p satisfies de- 
creasing monotonicity if and only if C satisfies increasing 
monotonicity, and p satisfies negative translation invariance 
if and only if C satisfies positive translation invariance. Fur- 
thermore, p is convex if and only if C is convex. By The- 
orem [1] this implies that C is a valid convex cost function 
if and only if p is a convex risk measure. The first half of 
the lemma then follows immediately from the representation 
theorem of Follmer and Schied @| . 

Now, because a(p) is guaranteed to be convex, X^i=i PiQi~ 
ct(p) is a concave function of p. The constraints Ysi=iP-> ~ 1 
and pi > define a closed convex feasible set. Thus, the 
problem of maximizing Y^^iPiQi ~ a (p) with respect to 
p has a global optimal solution and first-order KKT con- 
ditions are both necessary and sufficient. Let p*(q) de- 
note an optimal p for this optimization problem. Then, 

= Z^i=iP*(C* _ "(P *(?))■ % the envelope theo- 
rem [2!|, if C{q) is differentiable, we have that for any i, 
Pi(o) = dC(q) /dqi = pi(q). Thus the market prices are pre- 
cisely those which maximize the inner expression of the cost 
function. □ 



Furthermore, by a version of the envelope theorem [23| , to 
ensure that C is differentiable, it is sufficient to show that 
a is strictly convex and differentiable. 



2. Decreasing Monotonicity: 
x > x" , then p(x) < p(x'). 



For any x and x" , if 



3. Negative Translation Invariance: For any x and 
value k, p(x + kl) = p(x) — k. 

The financial interpretations of these properties are not 
important in our setting. More interesting for us is that 
Follmer and Schied [i| provide a representation theorem that 
states that a function p is a convex risk measure if and only 
if it can be represented as 



p{x) 



sup 

pGA N 



^PiXi - a(p) 



where a : An — > (— oo,oo] is a convex, lower semi- 
continuous function referred to as a penalty function. This 
fact is useful because it allows us to obtain the following re- 
sult, which was alluded to informally by Agrawal et al. 
The full proof is included here for completeness. 



Corollary 1. A function C is a valid convex cost func- 
tion if it can be represented as in Equation for a strictly 
convex and differentiable function a. For any q, the price 
vector p(q) is the distribution p maximizing JZiLi Pi°i~ a {p) ■ 



The ability to represent any valid cost function in this 
form allows us to define a bound on the worst-case loss of 
the market maker in terms of the penalty function of the 
corresponding convex risk measure. 



Lemma 5. The worst-case loss of the market maker de- 
fined by the cost function in Equation^ is no more than 



sup 

p'eAj, 



(a(p) - a(p)) 



Proof. The worst-case loss of the market maker is 



sider cost functions derived from scoring rules s by setting 



max [ max qi — C{q) ) + C(0) 

( ( N 

= max I max q 4 — sup I > piqi — a(p) 

?eE« ^ 6{ i,..,jv} jj eAjv 

+ sup (-a(p')) 



' / N 

max qi — sup 7 PiQi — sup (a(p)) 
ie{i,-,JV} \peA N frt peA N 

— max ( max qt — max (ft J + sup (a(p)) 



< max 

+ sup (-a(p')) 

p'GAjv 



max qi — iiidx yi I -\- a up 



+ sup {-a( 
p'GAjv 

= sup (a(p) - a{p ')) 

P,P 'GAjv 



The inequality follows from the fact that for any func 
tions / and g over any domain X, sup x£X (f(x) — g(x)) > 



□ 



5.2 Convex Cost Functions and Market Scor- 
ing Rules 

As described in Section[2] the Logarithmic Market Scoring 
Rule market maker can be defined as either a market scor- 
ing rule or a cost function based market. The LMSR is not 
unique in this regard. As we show in this section, any reg- 
ular, strictly proper market scoring rule with differentiable 
scoring functions can be represented as a cost function based 
market. Likewise, any convex cost function satisfying a few 
mild conditions corresponds to a market scoring rule. As 
long as the market probabilities are nonzero, the market 
scoring rule and corresponding cost function based market 
are equivalent. More precisely, a trader who changes the 
market probabilities from r to r ' in the market scoring rule 
is guaranteed to receive the same payoff for every outcome i 
as a trader who changes the quantity vectors from any q to 
q' such that p(q) = r and p(q') = r' in the cost function for- 
mulation as long as every component of r and f" is nonzero. 
Moreover, any price vector that is achievable in the market 
scoring rule (that is, any p for which s;(p) is finite for all i) 
is achievable by the cost function based market. 

The fact that there exists a correspondence between cer- 
tain market scoring rules and certain cost function based 
markets was noted by Chen and Pennock They pointed 
out that the MSR with scoring function s and the cost func- 
tion based market with cost function C are equivalent if for 
all q and all outcomes i, C(q) — qi — Si(p). However, they 
did not provide any guarantees about the circumstances un- 
der which this condition can be satisfied. Agrawal et al. [IJ 
also made use of the equivalence between markets when this 
strong condition holds. Our result gives very general pre- 
cise conditions under which an MSR is equivalent to a cost 
function based market. 

Recall from Lemma[4]that any convex cost function C can 

be represented as C(q) = sup- gAw (j2f =1 Piqi - at(jf)\ for 
a convex function a. Let etc denote the function a corre- 
sponding to the cost function C. In the following, we con- 



(7) 



and scoring rules derived from convex cost functions with 



Si{p) = a c {p) - 



dac(p) da c (p) 
Pj + 



3=1 



dp 3 



dpi 



(8) 



We show that there is a mapping between a mildly restricted 
class of convex cost function based markets and a mildly 
restricted class of strictly proper market scoring rules such 
that for every pair in the mapping, Equations [7] and [8] both 
hold. Furthermore, we show that the markets satisfying 
these equations are equivalent in the sense described above. 

Theorem 3. There is a one-to-one and onto mapping be- 
tween the set of convex cost function based markets with 
strictly convex and differentiable potential functions ac and 
the class of strictly proper, regular market scoring rules with 
differentiable scoring functions s such that for each pair in 
the mapping, Equations^ and\E\ hold. 

Furthermore, each pair of markets in this mapping are 
equivalent when prices for all outcomes are positive, that 
is, the profit of a trade is the same in the two markets if 
the trade starts with the same market prices and results in 
the same market prices and the prices for all outcomes are 
positive before and after the trade. Additionally, every price 
vector p achievable in the market scoring rule is achievable 
in the cost function based market. 

Proof. We first show that the function ac in Equation]?] 
is strictly convex and differentiable and the scoring rule in 
Equation [8] is regular, strictly proper and differentiable. We 
then show that Equations [7] and [S] are equivalent. Finally, 
we show the equivalence between the two markets. 

Consider the function ac in Equation [7J Since we have 
assumed that Si is differentiable for all i, ac is differentiable 
too. Additionally, it is known that a scoring rule is strictly 
proper only if its expected value is strictly convex [l2| , so 
ac is strictly convex. 

Consider the scoring rule defined in Equation [8] By The- 
orem 1 of Gneiting and Raftery pjj], a regular scoring rule 
Si(p) is strictly proper if and only if there exists a strictly 
convex function G(p) such that 

N 

Si (p) = G(p)-J2 Pi &i GO + 6 * & > ( 9 ) 

3=1 

where Gj (p) is any subderivative of G with respect to Pj (if 
G is differentiable, Gj = dG(p) /dpj). This immediately im- 
plies that the scoring rule defined in Equation [S] is a regular 
strictly proper scoring rule since a(p) is strictly convex. We 
will see below that Si is also differentiable. 

It is easy to see that Equation [8] implies Equation [Jj Sup- 
pose Equation [8] holds. Then 



da c {p) da c (p) 



YpMp) = £)Pi(ao(g)-X) dp, f Pj + 



3=1 



dpi 



= ac(p) ■ 

This also shows that Si is differentiable for all i, since the 



derivative of etc is well-defined at all points and 



9a c (p) /-a , V dsi(p) 



dp t 



To see that Equation [7] implies Equation [H] suppose that 
Equation [7] holds. We know that the scoring rule s can be 
expressed as in Equation for some function G. For this 
particular G, 



N 



N 



<*oW = G(p) -Y^PjGM + Giip) = G(p) . 

i=l V 3=1 / 

Since G(p) — ctc(p) and etc is differentiable (meaning that 
dac/dpi is the only subderivative of etc with respect to pi), 
this implies Equation [8] 

We have established the equivalence between Equations [7J 
and We now show that a trader gets exactly the same 
profit for any realized outcome in the two markets if the 
market prices are positive. 

Suppose in the cost function based market a trader 
changes the outstanding shares from q to q' . This trade 
changes the market price from p(q) to p{q')- If outcome i 
occurs, the trader's profit is 

(ql - qi ) - [C(q') - C(q)) 

= (qi - qi) - (^Yl p i (q')i'j ~ a c(p(q'))j 

+ ^2pj{q)qi -ot C [p{q)) 
\j=i j 

= ( q>i ~ JlPj^')ij + a c(p(q'))j 

Qi - ^Pj(q)qi + ot C (p{q)) . (io) 

^ 3 = 1 / 

From Lemma U we know that p(q) is the optimal solution 
to the convex optimization maxp-gAjy (X/i=iP»<fc — a c(jp)^- 
The Lagrange function of this optimization problem is 

/ n \ N N 

L = I ^Pi«?i - a c {p) - A(^pi - 1) + ^2mpi. 

\i = l / ! = 1 1=1 

Since p(q) is optimal, the KKT conditions require that 
dL/dpi = 0, which implies that for all i, 



da c (p(q)) ,^ 



(11) 



where Hi(q) > and Hi{q)Pi(q) = 0. Plugging (ITT|) into (fTO)) . 
we have 

(qi - qi) - {C(q') - C(q)) 

ldac{p{q')) ^ ,^,.da c {p(q")) r-r-»\\ r-'i 

= t dp~m - 5 } ^(go Qc(p(g )} " } 



da c {p(q)) 
dpi(q) 



^2pAq) 



i=i 



dacjpjq)) 

dpj(q) 



otc(p(q)) - Hi(q) 



When pi(q) > and pj(<T) > 0, m(q) - fJ,i(q') = 0. In 
this case, the profit of the trader in the cost function based 
market is the same as that in the market scoring rule market 
when he changes the market probability from p(q) to p(q')- 
Finally, observe that using the cost function based mar- 
ket it is possible to achieve any price vector r with finite 
scores Si(r) by setting qi — s,(r) for all i. By Lemma 2] 
for this setting of q, p(q) is the vector p that maximizes 
~!2f = PiSi(r) — J2^ =1 PiSi(p). Since s is strictly proper, this is 
maximized at p = f. Since s is regular, this implies that it is 
possible to achieve any prices in the interior of the probabil- 
ity simplex using the cost function based market (and any 
prices p on the exterior as long as Si(p) is finite for all i). □ 

5.3 Convex Cost Functions and FTRL 

Consider a prediction market with a convex cost function 
represented as C(q) = sup- gAjv (X)i=i P^ ~ o(p)J and 
the corresponding learning algorithm with weights Wi,t = 
Pi(-eLt-i). (Recall that L t -\ = (Li lt _i, ■ • • , L N ,t~i) is the 
vector of cumulative losses at time t — 1.) By Lemma [4] 
the weights chosen at time t are those that maximize the 



expression -fE^i^iii.t-i - 
that minimize the expression 



a(w), or equivalently, those 



N 1 
WiLi t t-i H — ct(w) 



This expression is of precisely the same form as Equa- 
tion O with a playing the role of the regularizer and e con- 
trolling the trade-off between the regularizer and the em- 
pirical loss. This implies that every convex cost function 
based prediction market can be interpreted as a Follow the 
Regularized Leader algorithm with a convex regularizer! By 
applying Theorem [2] and Lemma [SI we can easily bound the 
regret of the resulting algorithm as follows. 

Theorem 4. Let C be any valid convex cost function 
yielding <f>-stable prices, and let etc be the penalty func- 
tion associated with C. Let A be the expert learn- 
ing algorithm with weights as in Equation PJ with e = 
y / 2sup J?i? , sAjv (ac(p) - a c {p'))/(4>T). Then for any se- 
quence of expert losses £i,t £ [0, 1] over T time steps, 



La,t — min Li.r < 
»€{!,■•• ,n} 



2T4> sup (ac(p) — ac(p')) ■ 



This bound is very similar to the bound for FTRL given in 
Equation |4j with (f> playing the role of A. 

The connections we established in the previous section 
imply that every strictly proper market scoring rule can also 
be interpreted as a FTRL algorithm, now with a strictly 
convex regularizer. Conversely, any FTRL algorithm with a 
differentiable and strictly convex regularizer can be viewed 
as choosing weights at time t to minimize the quantity 



} j Wi (eLi, t _i + Si(w)) 



= {si(P(q')) - fj-i(q')) - (si(p(q>) - m(q)) ■ 



(12) 



for a strictly proper scoring rule s. Perhaps it is no surprise 
that the weight updates of FTRL algorithms can be framed 
in terms of proper scoring rules given that proper scoring 



rules are commonly used as loss functions in machine learn- 
ing BOS an d FTRL has previously been connected to Breg- 
man divergence s l34l . [Tsl . \Vj\ which are known to be related 
to scoring rules [L2| |. 

This connection hints at why market scoring rules and 
convex cost function based markets may be able to obtain 
accurate estimates of probability distributions in practice. 
Both types of markets are essentially learning the distribu- 
tions by treating market trades as training data. Beyond 
that, both markets correspond to well- understood learning 
algorithms with stable weights and guarantees of no regret. 

5.4 Relation to the SCPM 

Agrawal et al. [l| present another way of describing convex 
cost function based prediction markets, which they call the 
Sequential Convex Pari-Mutuel Mechanism (SCPM). The 
SCPM is defined in terms of limit orders instead of market 
prices, but the underlying mathematics are essentially the 
same. In the SCPM, traders specify a maximum quantity 
of shares that they would like to purchase and a maximum 
price per share that they are willing to spend. The market 
then decides how many shares of the trade to accept by 
solving a convex optimization problem. 

Agrawal et al. [l[ show that for every SCPM, there is an 
equivalent convex cost function based market. For each limit 
order, the number of shares accepted by the market maker in 
the SCPM is the minimum of the number of shares requested 
by the trader and the number of shares that it would take 
to drive the market price of the shares in the corresponding 
cost function based market to the limit price of the trader. 
Thus our results imply that any SCPM mechanism can also 
be interpreted as a Follow the Regularized Leader algorithm 
for learning from expert advice. 

We remark that Agrawal et al. [l[ also describe an in- 
terpretation of the SCPM in terms of convex risk measures 
and suggest that the associated penalty function is related 
to the underlying problem of learning the distribution over 
outcomes. However, their interpretation is very different 
from ours. They view the penalty function as characterizing 
"the market maker's commitment to learning the true distri- 
bution" since it impacts both the worst case market maker 
loss and the willingness of the market maker to accept limit 
orders. On the contrary, we view the penalty function as a 
regularizer necessary to make the market prices stable. 

6. EXAMPLE: THE QUADRATIC MSR 
AND ONLINE GRADIENT DESCENT 

In the previous section we described the relationship be- 
tween market scoring rules, cost function based markets with 
convex cost functions, and Follow the Regularized Leader al- 
gorithms. We discussed how the Logarithmic Market Scor- 
ing Rule can be represented equivalently as a cost function 
based market, and how it corresponds to Weighted Majority 
in the expert learning setting. In this section, we illustrate 
the relationship through another example. In particular, we 
show that the Quadratic Market Scoring Rule can be writ- 
ten equivalently as a cost function based market (namely 
the Quad-SCPM of Agrawal et al. QJ). We then show that 
this market corresponds to the well-studied online gradient 
descent algorithm in the learning setting and give a bound 
on the regret of this algorithm using Theorem 2] 

The Quadratic Market Scoring Rule (QMSR) is the mar- 



ket scoring rule corresponding to the quadratic scoring func- 
tion in Equation [T] As was the case in the LMSR, the pa- 
rameters ai, • • • , ajv do not affect the prices or payments of 
this market. As such, we assume that ai — for all i. 

Theorem [3] implies that we can construct a cost function 
based market with equivalent payoffs to the QMSR when- 
ever prices are nonzero using the cost function 



C(q) 



sup ^£>i<7i - 2pi ~ 



N 



i = l 
N 



= sup ^Piqi - b^p* 

This is precisely the cost function associated with the Quad- 
SCPM market with a uniform prior, which was previously 
known to be equivalent to the QMSR when prices are 
nonzero 0]. The worst case loss of the market maker in 
both markets is b(N - 1)/N. 

Following the argument in Section \5. 31 this market corre- 
sponds to the FTRL algorithm with regularizer rj = 1/6 and 
lZ(w) = 53fc=i w i ■ ^ nas been observed that using FTRL 
with a regularizer of this form is equivalent to online gradi- 
ent descent [l9l . [l7j . Thus we can use Theorem [4] to show a 
regret bound for gradient descent. 

We first show that the Quad-SCPM prices are 0-stable for 
4>= (iV 2 -l)/(2b) < N 2 /(2b). (See AppendixOfor details.) 
We can therefore apply Theorem U using (/> = N 2 /(2b) and 
sup ?]? , eAjv (a(p) - a(p')) = b(N-l)/N < b to see that for 
gradient descent, 

Lgd,t — mm 

This matches the known regret bound for general gradient 
descent applied to the experts setting [35| . 



7. DISCUSSION 

We have demonstrated the elegant mathematical connec- 
tion between market scoring rules, cost function based pre- 
diction markets, and no-regret learning. This connection is 
thought-provoking on its own, as it yields to new interpre- 
tations of well-known prediction market mechanisms. The 
interpretation of the penalty function as a regularizer can 
shed some light on which market scoring rule or cost func- 
tion based market is best to run under different assumptions 
about traders. 

Additionally, this connection has the potential to be of 
use in the design of new prediction market mechanisms and 
learning algorithms. In recent years there has been an in- 
terest in finding ways to tractably run market scoring rules 
over combinatorial or infinite outcome spaces 0, [Tol . [f| . For 
example, a market maker might wish to accept bets over 
permutations ("horse A will finish the race ahead of horse 
B"), Boolean spaces ("either a Democrat will win the 2010 
senate race in Delaware or a Democrat will win in North 
Dakota"), or real numbers ("Google's revenue in the first 
quarter of 2010 will be between $x and $?/"), in which case 
simply running a naive implementation of an LMSR (for 
example) would be infeasible. As mentioned above, by ex- 
ploiting the connection between Weighted Majority and the 
LMSR, Chen et al. [(| showed that an extension of the 
Weighted Majority algorithm to permutation learning [2(| 
could be used to approximate prices in an LMSR over per- 



mutations. Given our new understanding of the connection 
between markets and learning and the growing literature on 
no-regret algorithms for large or infinite sets of experts [2lJ , 
it seems likely that similar learning-based techniques could 
be developed to calculate market prices for other types of 
large outcome spaces too. 
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APPENDIX 

A. PROOF OF LEMMA 2 

Fix the vectors q and r. Let u(s) = q + sr. Similarly to 
how we have defined Dij(t), define 



dpi(u(s)) 

Di{x) = { as I 




if 9p ' ("(")) is defined at x, 

OS ' 

otherwise. 



Using Di in place of the derivative allows us to integrate over 
the derivative even when it is not defined at single points. 
For any point at which the derivatives are defined, we have 

8C(u(s)) ^ dC(u(s)) duds) ^ t-tw 
ds ^ duds) ds 

Applying the fundamental theorem of calculus, we have that 

C(q + r) - C(q) 

f 1 dC(u(s)) f 1 ^ 

= / „ dx= I } pi{u(x))n dx 

Jo os s=x J , =i 

= / £ (pi{ti(0)) + J o Di{y)d^ r t dx 

N x x N 

= £^(9)^+/ / £nA(2/) dy dx . 



Rearranging terms, this gives us that 

C(q + r) - C(q) - YWflr, = / / £ r 4 A(») dy dx. 
i=l Jo Jo i=l 

To prove the lemma, it is sufficient to bound the absolute 
value of the expression on the right. This is where the in- 
stability of the prices comes into play. At any point where 
the derivatives are defined, 



£ 



dpi(u(s)) 



ds 



£ 



E 



<*££ 



dpi(q) 



< en 



Since we have assumed that the prices are piecewise diffcr- 
entiable this implies that 



1 r x N 

/ y) r iA(y) dy dx 
o Jo i=1 

1 x N 

< e f f ^2\Di(y)\ dydx 



< e 



o Jo 

1 rX 2 f 1 ^ $ 

e<f) dy dx = e <f> x dx — — — 
Jo 2 



This bounds the absolute value of the right hand side of the 
equation above and proves the lemma. □ 



B. PROOF OF LEMMA 3 



For every i and every j i, we have 



dqt 



d 



ie?i /fc E ^ =i e^-( e «/<>) 2 

\ {pi{q) -Pi{qf) =^i(g)E^® 



and 



dpr{q) 

dqj 



_ e <Ji/i> e 9j/iJ 



= --£Pi(q)Pi($) 



i/b b 



Thus we have 

JV JV 



EE 

i=l 3=1 



<9<7j 



E 



dpi(q) 

dqt 



N 

EE 

i=l j'^i 



9gj 



i=l j^i 

We would like to find the prices that maximize this quan- 
tity. Dropping the argument q to simply notation, this is 
equivalent to solving a simple optimization problem: 



maximize - 

pSA N o 



It is straight-forward to show (e.g., using the KKT condi- 
tions) that this expression is maximized when the prices are 
equal across all securities, so pt — 1/N for all i. Then 



EE 

i=i j=i 



dpM < 2 N(N-1) < 2 



dqj 



b N 2 



□ 



C. STABILITY OF QUAD-SCPM PRICES 

The cost function of Quad-SCPM can be written as 

/ N N 

C(q) = sup ^p l q l - bj^p- 

By Lemma U the price function of Quad-SCPM is defined 
by the optimal solution to the optimization problem in this 
function. The Lagrange function corresponding to the con- 
strained optimization problem is 

/ n N \ / n \ N 

L = ( E pi?i ~ & E^ ) _ A ( E pi ~ 1 J + E^ pi - 

\i=l t=l / \i=l / i=l 

Because the objective function is strictly concave in p, there 
is a unique optimal solution that satisfies the KKT condi- 
tions: dL/dpi = Vi, Ei=iP* = 1) Pt > Vi, /x, > Vi, 



and j-iiPi = Vi. Thus, the price function is defined by 



if e > 0, and 



* 1 iV 26 

Pi/ii = Vi 
P; > Vi 
> Vi. 



26JV 



Vi 



(13) 



When l/N + q t /(2b) - J2? = i nl&bN) > 0, m = and 
Pi > for all i and the price function is 

PiWJ - TV + 2 & 267V ' 
which is the same as that of QMSR. In this case, we have 



dqi 



1 

26 



1 

2bN 



Vi and 



dp t 
dqj 



1 

2bN 



Consider the case in which the prices for some outcomes 
equal 0. Given q, let M — {m : p m > 0, /Lt m = 0} be the set 
of outcomes that have positive prices. Let A — {k : p k = 
0, /ifc > 0} be the set of the outcomes that have a positive 
/ife. Let L = {I : pi = 0, fii = 0} be the set of outcomes for 
which both pi and m are 0. Denote p — min m6 MPm and 
p. = mhifcgA'Pfc. For < |e| < min{/2, 2bp(N — \K\ — \L\)}, 
we consider the following cases: 

• L = and i G M 

Consider changing q to q', where g- = qi + e and q'j = 
Qj Vj 7^ i. We can verify that the price function for q" is 
defined by 



Pi— fi ' \2b 
Pj = Pi ~ 2b(N-\K\) 
Pfc = Pfc = o 
Mj = Mi = o 
Mfc = Mfc + iv_ e |A-| 



2b(JV-|A"|) - 



Vj€ M,j^i 
Vfc g A 
Vj G M 
Vfc G A 



where p* and jl are the prices and Lagrange multipliers 
for q. Hence, by the definition of derivatives, we have 



dpi 
dqi 



1 



2b 2b(N-\K\) 



Vi G M 



dqi 



2b(N - |A'|) 



Vi,j G M,i^j 



dp k 
dqi 



o Vfceif.ieM 



• L 7^ and i G Af 

Consider changing (f to <f' , where q\ — qi + e and q'j 
<?j Vj 7; i. The new prices are defined by 



Pi -- 
P'i- 

A 
p'i-- 

i''k 

i 



Pi + ih 



■ 2b 2f>(JV-|JC|- 
: Pi ~ 26(JV-|K|-|i|) 

= p fe =0 
Pi = o 
= mj = o 

z Mfc + JV-|A'|-|L 
1 Mi + AT-|a|-|l| 



Vfc g A 
via 

Vj G M 
Vfc G A' 

vz g l 



2b(N- 





Pi = P> + (^ 
P'j = P3 

p'k = Pfc 
Pi = Pi 

Mj = Mj 
Ml- = Mfc + jv 
fi[ = /xj = 



26 2b(Ar_|A"|) 



1*1) 



2i)(JV- 
: 



|A|) 



I A' | 



Vj G M,jj=i 
Vfc G A 

VI eL 
Vj G Af 
Vfc G A" 
Vi G 1/ 



if e < 0. 

From the above, we can see that the prices of all out- 
comes are continuous while we changing qi in its e- 
neighborhood. However, dpi/dqi, dpj/dqi, and dpi/dqi 
are not defined at q for all i, j G A/, i 7^ j, and Z G L, 
because the left and right derivatives do not equal. We 
only have dpt/dqi = for all k G A and i G M. 

When changing q to q' , where q[ — qi + e and q'j — 
1i Vj 7^ h similar to the above case, the new optimal 
solution is different for e > and e < 0. In particular, 
when e > 0, 



Pi 



Pl + { 2k 



2b(N- \K\) 



Pi 



Pi 



2b{N-\K\) 



Vj G M U L such that j / I, 



Pi Vj, 



and p' k = pfc = Vfc G K. When e < 0, J 
Ml = Ml - e . and Mj = Mj Vj / fe. 

We can see that the prices of all outcomes are continuous 
while we changing qi in its e- neighborhood. But dpi /dqi 
and dpj/dqi are not defined at <f for all j € MUL, j ^ I, 
and Z G I/, because the left and right derivatives do not 
equal, dpk/dqi = for all k G K and Z G L. 
• ke K 

Consider changing q to q" , where g fc = gt + e and = 
9i Vj 7^ fe, the new prices are defined by p'j = pj Vj, 
Mfc — Mfc — e > an< i Mj = Mj Vj 7^ fc. Thus, we have 
dp k /dq k = Vfc G A, <9pj/dgfc = Vfc, j G A, and 
d Pj /dq k = Vj G A/, fc G A. 

The above shows that the price functions are continuous 
everywhere, but not differentiable everywhere. In particular, 
when L is not empty (i.e., given q there exists some outcome 
i such that both Pi(q) and fii (q) axe zero) , some of the partial 
derivatives are not defined at q. This corresponds to the 
second and third cases shown above. We further note that 
in these two cases, for any i € M and Z G L, any change 
in qi or qi will cause the set of L to become empty, because 
either p; or [i\ will become positive for all Z G L. This means 
that the prices are differentiable almost everywhere, with 
the only exceptions at finite number of points when L 7^ 0. 

Because < \K\ < N - 1, < A,*(t) < 1/(26) - l/(2bN) 
and -1/(26) < D i:j (t) < 0. We have 



AT AT 

i=l J=l 



1 

26 



1 

26iV 



+N(N-1) 



N 2 



26 



26 



□ 



